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ARTICLE INFO ABSTRACT 








Central heating system faults affect building energy consumption and indoor thermal comfort significantly. The 
interdependencies among system components and multiple failure modes present a challenge for system health 
diagnostics and prognostics. A reliable diagnosis and prognosis can only be ensured when all component con- 
ditions are monitored with minimum uncertainty. In this regard, sensors should be selected based on their 
priority in providing system health information. Currently, most of the research on sensor optimization models 
optimize sensors position and orientation. However, in this study sensor type is optimized as well. In addition, 
the proposed method is based on the Bayesian network model and considers failure interdependencies of 
components, which leads to more realistic results. 

The objective of the optimization model is to obtain lowest entropy on system health information given 
financial and practical constraints. The proposed methodology is applied to a central heating system as a case 
study. The Bayesian Network model of the central heating system is constructed based on knowledge regarding 
the component interdependencies and conservation laws, as well as the gathered historical data. Results indicate 
that the Bayesian network model represents component interdependencies, and provides valuable information 
for sensor combination optimization. Sensors are optimized to obtain the minimum system information un- 
certainty. In addition, the sensitivity analysis on data generated by sensors demonstrates that the proposed 
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method diagnosis system faults correctly, and predicts system health status effectively. 





1. Introduction 
1.1. Overview 


The faulty operation of central heating system (CHS) equipment 
results in the energy loss and malfunction of other interconnected 
equipment in heating, ventilation, and air-conditioning systems. The 
central heating system faults arise from many different mechanical, 
operational, and environmental factors. The root cause diagnosis of 
system faults facilitates in improving system efficiency, reliability, and 
availability [1]. The sensor network provides valuable information on 
the system operation [2]. Sensors can be located at different places and 
can measure various parameters of the system. However, it is not 
possible to attach sensors on all streams of the system. Therefore, the 
effective specification and selection of sensor combination to support a 
reliable diagnosis of potentially critical failure modes is required [3]. In 
this study, an optimization algorithm for sensor combination selection 
is proposed. 
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In most research, sensor placement is optimized. This implies that 
the position and orientation of a specific sensor (thermometer, flow- 
meter,.) is optimized to obtain more information about the system 
considering multiple constraints. However, in sensor combination op- 
timization, various sensor types are compared and the best combination 
that can convey more information considering the constraints (in- 
cluding budget limit, weight allowance limit, etc.) is selected. Sensor 
combination optimization is a new concept that is applied on central 
heating systems in this study. 

Furthermore, it is critical to follow a diagnostic approach to detect 
system root cause faults and predict the remaining useful lifetime (RUL) 
of system components efficiently. System fault diagnostics was in- 
troduced in recent years for numerous applications such as aerospace, 
process control, automotive, and manufacturing [4—6]. Implementing a 
reliable fault diagnostic method for a CHS is vital to avoid unexpected 
faults and failures. Generally, it is estimated that fault diagnostic 
methods and other typical actions such as measurement and verifica- 
tion in buildings, can reduce up to 30% of CHS energy consumption 
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depending on the system status from the maintenance program, system 
age, and environmental standpoints [7]. 

In this study, a fault diagnostic methodology for CHSs is proposed to 
detect system failure and predict the RUL of system components with 
minimal uncertainty in real-time operation. In order to show how the 
proposed method should be implemented on different cases, it is ap- 
plied to a simple case study, and all the steps of the method is explained 
in detail. 


1.2. Literature review 


The problem of optimizing sensor placement has received con- 
siderable attention in recent years [8]. The available approaches of 
optimization models differ in their objective functions, assumptions 
regarding equation linearity, solution method, and how they handle 
uncertainty. 

For instance, Er et al. [9] investigated a sensor placement approach 
for machine monitoring to improve its efficiency. Because moving 
mechanical parts in a machine is the reason for vibrations affecting 
performance, it is necessary to monitor them. By placing minimal 
sensors on the key points of a machine, detecting an actual vibration 
spectrum is possible. Their results indicated that by using sensitivity 
calibration methods with the normalized Fisher information, the op- 
timal sensors are moderated sensitivity-calibrated sensors. In another 
study, Castro-Triguero et al. [10] determined the effect of parametric 
uncertainties on the optimal sensor placement of a truss bridge. Three 
variables including Young’s modulus, mass density, and cross-sectional 
dimensions had been chosen, and their effects on sensor placement 
were studied. Their result illustrated the considerable influence of 
sensor placement on the mentioned parameters. 

Unlike sensor placement optimization, research on sensor combi- 
nation optimization is scarce. In the work of Parhizkar et al. [11], a 
sensor combination in an oil pipeline network was optimized. In this 
study, various sensor types and models were compared, and the optimal 
combination that detects the failure modes (oil pipeline corrosion and 
leakage) with the highest reliability considering the budget limit was 
selected. 

After installing the optimal sensor combination on a system, sensor 
signal data are gathered. These data are converted to information and 
knowledge in a fault diagnosis model. System fault diagnostic models 
can be classified into two categories of quantitative and qualitative 
models, which are different based on the required knowledge for the 
formulation. The qualitative model is based on the observation of non- 
numerical data. The quantitative model emphasizes objective mea- 
surements and the statistical, mathematical, or numerical analysis of 
data collected through sensors and inspections [12]. 

The solution methodology of these models is based on supervised 
and unsupervised learning methods. Supervised learning is based on 
system historical data, and requires a priori knowledge of what the 
result should be. It is difficult and costly to acquire fault data from 
physical systems. To use this method in fault diagnosis models, training 
sets are typically generated based on a physical system model. Yan et al. 
[13] proposed a semi-supervised approach to detect faults of air 
handling units (AHUs). In this study, the tradeoff between the initial 
number of faulty samples and computational cost was studied. 

To handle priori unknown faults, unsupervised learning method is 
used. In this method, no trainer exists, and network is exposed to a 
number of inputs. Costa et al. [14] proposed an unsupervised method 
for the fault detection of industrial plants. In this study, the diagnosis 
process was based on a fuzzy-rule-based classifier called AutoClass. 

In central heating units, system failure modes are known, and the 
physical model of the system is available. Therefore, a quantitative 
model based on supervised learning methods can be used for system 
fault diagnosis. 

The supervised algorithms applicable in fault diagnosis includes the 
artificial neural network (ANN) with Levenberg—Marquardt (LM), 
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Bayesian Network (BN) model, Gaussian process regression, nonlinear 
autoregressive network with exogenous inputs with LM and BR, back- 
propagation, regression trees, and support vector regression [12,15,16]. 

For instance, Yildiz et al. [17] studied multiple methods for fore- 
casting electricity load faults in a building. Their results stated that the 
ANN with Bayesian regulation backpropagation exhibited the best 
overall root mean squared and mean absolute percentage error per- 
formance. Peitsman et al. [18] used the ANN method to detect and 
diagnose faults in an AHU system. The study proposed two stages of the 
ANN, one stage to detect abnormalities in the system, and the other 
stage to isolate the faults. Residuals of different parameters were used 
as inputs, and the trained network was used to diagnose the root cause 
of the faults. Lee et al. [19] developed a general regression neural 
network trained with normal operating data to predict model para- 
meters. The deviation of the actual data from the predicted value is 
considered as a sign of fault. Recently, Du et al. [20] developed an ANN 
model integrated with a wavelet and fractal data preprocessing tool. 
Data preprocessing tools were used to increase the efficiency of the 
ANN. 

As mentioned, in most of the reviewed literature, system un- 
certainty and components interactions were ignored. However, these 
factors affect system fault detection procedure significantly. Some 
methods of supervised learning can be used to consider data un- 
certainties. An excellent method is the Bayesian network that is used in 
different studies. Zhao et al. [21,22] implemented a dynamic Bayesian 
network (DBN) model to identify 28 typical faults of an AHU system. 
Their results indicated that the DBN method was effective in fault di- 
agnostic even when the diagnostic information was uncertain and in- 
complete. Widarsson et al. [23] developed a methodology using the BN 
method to warn the early leakage in a recovery boiler, and detect faults. 
Their results indicated that a normal load operation exhibits a leakage 
probability under 10%, and warning occurs if the leakage probability 
exceeds 10%. 

As shown in the literature review, most methods were based on 
supervised quantitative methods that required faulty datasets for di- 
agnostic. As the fault diagnostic process contains uncertainties, the 
reviewed literature proves that the BN method is a powerful method for 
fault diagnostic problems. Consequently, in this study, a supervised BN 
method was used for the fault diagnosis of CHSs. 


1.3. Research contributions 
The major contributions of this study are as follow. 


e A new sensor combination optimization method was investigated for 
central heating systems. This method optimizes the sensor type, in 
addition to its position and orientation that is considered in the 
reviewed literature. 

e The developed sensor combination optimization method considers 
the failure interdependencies of the component in the optimization 
procedure. Therefore, the optimal selected sensors are more realistic 
in comparison with other sensor selection optimization methodol- 
ogies that assume that the failure mechanisms of the system are 
independent. 

e A novel fault diagnostic methodology is proposed for CHSs. This 
approach diagnoses system faults including pipeline leakage, burner 
failure, pump failure, and thermal comfort failure in real time with 
minimal uncertainty. 

e The proposed methodology predicts the RUL of the CHS components 
in a real-time operation and predicts the system component reli- 
abilities over the operating lifetime. Real-time health prognostic is a 
new concept that is proposed in this research for CHSs. 


1.4. Paper outlines 


This paper is organized as follows: In Section 2, the overview of the 
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proposed methodology is presented. The system configuration and 
problem under study are explained in Section 3. The optimal sensor 
combination methodology and system fault diagnostic model are in- 
vestigated in Sections 4 and 5, respectively. The application of the 
developed methodology and results are presented in Section 6. In ad- 
dition, the effectiveness of the proposed method is discussed in this 
section. Finally, the study is concluded with an overall discussion on the 
advantage of the developed methodology in Section 7. 


2. Overview of the proposed methodology 


The BN method is an effective tool for modeling complex systems 
under uncertainty [24,25]. This method has been used since the 1980s 
[26,27] in uncertainty analysis and systems fault diagnostics. 

In fact, the BN is a type of probabilistic graphical model that is used 
to build a system model based on the expert analysis of system histor- 
ical data [28]. Such a process is illustrated by two general elements, i.e., 
structure and variables. The structural part of the BN is a graphical and 
qualitative model that constructs the chain between the modeled 
variables. Furthermore, BN variables are presented as BN nodes. The 
arcs among nodes represent the direct dependency of the variables. A 
simple BN is illustrated in Fig. 1. 

Here, X; are BN nodes that represent random variables. The statis- 
tical and probabilistic dependencies of the nodes are related and they 
affect each other through arcs. Nodes such as X, and X, are called 
parent or root nodes that do not have any inputs. Further, the other 
nodes such as X, is called the child. The full joint probability dis- 
tribution with random variables can be calculated simply by Eq. (1): 


k=4 
rx, | = JI Pex 
or (1) 


Generally, the BN describes P(X) in Eq. (1) as joint probabilities over 
a random variable where a; is all the parent or root nodes that are in- 
dependent, and k is the number of nodes. 

In this study, the sensor combination is optimized and system faults 
are diagnosed based on the system BN model. The flow diagram of the 
proposed approach is illustrated in Fig. 2. 

As depicted in Fig. 2, at the first step, the BN of the CHS is learned 
using the system historical data. In this step, the initial probability 
distribution of each node states is determined. Subsequently, the pos- 
sible nodes for sensor placement are identified. These variables are used 
in an optimization model, and the optimal sensor combination that 
minimizes system fault diagnostic uncertainties will be derived. This 
step is performed in the design phase of the system. The output of the 
optimization model determines the optimal sensor combination that 
should be placed in the system. 

The next step is the proposed fault diagnosis model. This step is 
performed continuously during the operation phase of the system. In 
this step, the system faults are diagnosed based on data from the 





Fig. 1. Sample of Bayesian network. 


Journal of Building Engineering 26 (2019) 100835 


optimal sensor configuration. Furthermore, the RULs for the primary 
components of the CHS are estimated in this step. 

The sensor combination optimization model and fault diagnosis 
model are based on the developed Bayesian network model at the first 
step. The BN model of the system is updated over the operating time; 
consequently, the system failure modes and RULs are presented in real 
time. 


3. System description 


A boiler is the primary part of the CHS; it is responsible for pro- 
viding hot water and/or the heating demand of a building. It consists of 
a closed vessel in which water and fuel are heated. The energy source of 
a boiler can be supplied from natural gas. Air and natural gas are 
burned in a burner of the boiler to heat up the circulating water. Hot 
water is pumped to the radiators. In the radiators, the hot water heat is 
transferred directly to a comfort zone. A simple schematic of a central 
heating unit is shown in Fig. 3. 

For fault diagnosis, it is necessary to monitor the operating signals 
of CHS streams. The primary operating conditions that can be mon- 
itored in a CHS are the comfort condition, mass flow rate, temperature 
of the radiator inlet water and mass flow rate, temperature of radiator 
outlet water, temperature of the boiler outlet and inlet water, burner 
status, mass flow rate of inlet air, mass flow rate of fuel consumption, 
and leakage of the system. It is noteworthy that these variables are 
correlated with mass and energy balance equations. Therefore, mon- 
itoring independent variables that can present stream states are suffi- 
cient for reliable system fault detection. The primary independent 
variables that can describe the CHS operating status are the ambient air 
temperature, return water temperature of boiler, outlet water tem- 
perature of boiler, fuel consumption, and flow rate of makeup water, 
which are illustrated in Fig. 3 with red dots. These variables (system 
operating conditions) can determine all the primary failure modes of 
the system. The primary failure modes of the system are pump failure, 
burner failure, pipeline leakage, and thermal comfort failure, all of 
which are considered in this study. The BN of the studied CHS that 
demonstrates the dependency of the failure modes to the mentioned 
operating conditions is presented in Fig. 4. This network is a general 
form of a BN for CHSs. The studied case, in this paper, considers one 
boiler, a pump and a radiator, and one thermal zone in the building. As 
a result, its BN is the same as the general form. However, in a case of 
multiple boilers, pumps and radiators, relevant intermediate nodes 
should be copied for each extra unit in the system. In addition, room 
temperature node should be duplicated for each extra thermal zone in 
the building as well. It should be noted that all the node connections 
would remain the same as the general form of the BN. 

Owing to budget limits, it is impossible to place sensors on all in- 
dependent operating condition variables. Consequently, the sensors 
should be prioritized based on the obtained information value of each. 
This procedure will be performed in the sensor combination optimiza- 
tion model. After sensor selection, the signals from sensors are mon- 
itored and analyzed in the fault diagnosis model to detect failure 
probabilities in the system. 


4. Sensor combination optimization 


In this study, a sensor combination optimization algorithm is pro- 
posed to diagnose system faults with minimal uncertainty considering 
the budget limits. A sensor signal detects an operating variable value. A 
combination of sensors detects multiple operating variables value. 
According to the system principles and the dependencies of failure 
modes to the operating variables, these sensor combinations contain 
different information value. Some combinations can convey more in- 
formation about the failure mode status in the system. Thus, it is im- 
portant to prioritize the installation of sensor combinations. The data 
flow algorithm of the sensor combination optimization is presented in 
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Fig. 2. Flow diagram of the proposed approach. 
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Fig. 3. Schematic diagram of a central heating system. 


Fig. 5. 

At the first step, the maximum number of sensor types (n) that can 
be placed in the system is determined. The maximum number of sensors 
in the studied CHS that can be placed in the ambient temperature, make 
up water flow rate (i.e. water supplied to compensate for loss by eva- 
poration and leakage), fuel flow rate, return water temperature, and 
outlet water temperature nodes is five. 

Subsequently, sensor i is selected randomly. This sensor is a node in 
the BN model with m states. For instance, the ambient thermometer 
contains an ambient temperature node in the BN with three states 
(Table 2). 

In this step, the state j of node i is selected as an evidence, and the 
probability distributions of all nodes will be updated based on the se- 
lected evidence. In other words, the state probability of all nodes are 
updated using BN principles considering the state j of node i as an 


evidence. 

Subsequently, the information entropy of all nodes are calculated. 
The information entropy quantifies the system information uncertainty. 
Generally, the larger an uncertainty is associated with a random vari- 
able, the larger is its information entropy. The information entropy of a 
BN node is calculated as Eq. (2). 


H(X) = — J, P(Xj)log,P Xj) 
j=1 (2) 


where P (X;j) is the probability mass function of the state j of node i, and 
b is the base of the logarithm. Its typical values are 2, e, and 10. The 
information entropy of a BN is the sum of the all nodes’ information 
entropy that is presented as a utility function in Eq. (3). 


U(X) =— YY (P(X;)log,P X) 
j=l jel (3) 


Ui (X) presents the utility function of the BN considering the evidence j 
of sensor i as an input. The utility function is calculated for all evidences 
(j = 1,...m) of sensor i. For instance, the ambient temperature node 
comprises three states; therefore, the utility function will be calculated 
thrice considering each state as an evidence. After calculating the utility 
function of all sensor evidences, the expected value of the utility 
function for sensor i is calculated. The expected value E(U) represents 
the mean value of the utility functions of sensor i, and is calculated by 
averaging over all possible outputs from the sensor evidences, as shown 
in Eq. (4). For the ambient temperature node, the probability of each 
state (T < 12, 12 < T < 21, 21 < T) can be derived from weather ob- 
servation stations. It should be noted that the classification of tem- 
perature range is based on the weather data report. As data show the 
average daily temperature of Tehran is between 12 to 21 °C, the am- 
bient temperature node is classified into three states of temperatures: 
lower than 12°C, between 12 and 21 °C, and higher than 21 °C. 
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Fig. 4. Bayesian network of a central heating system. 


E(U') = YUM x P(X) 
j=l (4) 


An information value index is introduced in this study to quantify 
the information value of the sensor signals. In other words, the model 
searches for a sensor that can reduce the information entropy of failure 
nodes maximally. The information value I is calculated from Eq. (5). 


E 

15l- Fe (5) 
where E* is the theoretical maximum expected value of the utility 
function. It represents the maximum entropy of the BN nodes that is 
calculated based on the prior probability distribution of BN nodes using 
Eq. (2). 

After calculating the I function for all nodes, the node with the 
maximum information value is selected as the best sensor, as shown in 


Eq. (6). 
Best Sensor = Max{I} (6) 


A sensor with maximum I is selected as an optimum sensor. At the 
next step, this process will be repeated for other available sensors in the 
combination of the selected one. In other words, all dual combinations 
of the selected sensor with other available ones are considered, and the 
information values are calculated. Subsequently, the dual combination 
with the maximum information value will be selected as the optimum 
combination. This process will be continued iteratively to select the 
next optimum sensor combination until the budget constraint is sa- 
tisfied. 


CKE (7) 


C* is the maximum available budget. This constraint represents 
sensors being added to the monitoring system until the sensor cost 
exceeds the available budget. 


5. Fault diagnostic model 


The fault diagnostic process identifies the system health status, and 
indicates the type of fault and its location. In this study, system faults 
are detected based on analyzing the sensor signals using the BN model 
of the system. 

The first step is data filtration. In this step, all sensor signals are 


processed and cleaned based on multiple logical constraints. One of the 
main constraints limits the range of sensor signals in each time interval. 
This range is determined based on sensor historical signals. This con- 
straint can help to reject signals from faulty sensors. 

After this step, all cleaned data are processed in the BN of the 
system. In the BN, sensor signals are considered as evidences of nodes. 
The information provided by the evidence updates the available 
knowledge of the nodes; thereafter, the posterior probabilities of the 
non-observed nodes will be achieved. This process is named BN in- 
ference. In a BN, the inference is based on the Bayesian theorem. 
Suppose that X1, X2, Xx are a set of random variables and satisfy the 
following condition: 


(i) P(X) > 0, i=1, 2, .., k; 
(ii) YAN, = S, S is a certain event; 
(ii) They are mutually exclusive [4]. 


For any given event (a), the posterior probability P(X;|a) is calcu- 
lated as Eq. (8). 
P(Xa) __  P(XÐ)P (aAX) 


P(X a) = a alla LA 
MI Di P WP aX) (8) 





The posterior probabilities of the BN nodes indicate the probability 
of node state occurrences. The node state can be the probability of a 
component failure. Therefore, by updating a BN based on the sensor 
signals, the probabilities of system component failures will be updated. 

In addition, the posterior probabilities of the BN can be used in 
predicting the RUL of the system components. This concept can be used 
as a method for the health prognostic of complex systems. As men- 
tioned, the BN model output is the system failure probabilities in real 
time. Mathematically, the reliability function R(t) represents the 
probability of survival and is calculated as Eq. (9). 


R(t) = 1 — F(t) = e” (9) 
where F(t)is the failure probability and A is the failure rate of the 
system’s components. Consequently, A can be calculated from Eq. (10), 
-TBR O _ -bl - FOI) 

KAN een t (10) 


Subsequently, the mean RUL of the components can be calculated as 
Eq. (11), [29]. 


A 
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Fig. 5. Data flow algorithm of sensor combination optimization. 


1 
RUL = — 
A (11) 


6. Results and discussion 
6.1. System description 


The considered CHS in this study is a typical heating unit in an 
official building located in Tehran, Iran. The structure of the BN is 
defined as sufficiently flexible to be adoptable for most heating unit 
systems in official and residential applications owing to the con- 
sideration of all possible faults. The BN model was learned based on the 
system historical data. 

As depicted in Fig. 6, five possible faults nodes and four potential 
nodes exist for the sensor placement in the system. The primary char- 
acteristics of the studied boiler, radiator, pump, and burner are pre- 
sented in Table 1, and more details of the system are provided in 
[30-33]. 

The BN of the studied CHS is illustrated in Fig. 4. The BN model was 
developed in the GeNle software. This software used for modeling and 
learning with Bayesian networks [34]. As illustrated, five possible 
failure detection nodes and four potential nodes exist for the sensor 
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placement. 

All sensor variables were discretized into several levels based on the 
average standard level of each variable. The states of the sensor nodes 
are presented in Table 2. Based on the weather data report, the average 
daily temperature of Tehran is between 12 to 21°C. The ambient 
temperature node is classified to three states of temperatures: lower 
than 12°C, between 12 and 21 °C, and higher than 21 °C. Moreover, the 
states of the makeup water flow rate node are defined as low and high 
levels. The low-level state refers to flow rates that are lower than the 
standard make up water flow rate, and the high level presents flow rates 
that are higher than the standard level. The states of fuel consumption, 
outlet water temperature, and return water temperature nodes are de- 
fined based on the component characteristics of the CHS presented in 
Table 1. 


6.2. Optimal sensor combination 


Through applying the methodology presented in Fig. 5, a sensor is 
first selected randomly and the information value of the sensor is cal- 
culated. In the case under study, there are five type of sensors, so each 
sensor is selected separately, and provided information from that sensor 
is calculated accordingly. The information values of all sensors are 
presented in Fig. 6. As shown, the information value has a low level for 
all sensor types as each sensor can provide limited information about 
system failure modes. In addition, it can be seen that the make up water 
flowmeter contains the most information value and is selected as the 
optimal first sensor among the others. 

The information value of the sensors is the conferred system failure 
node probabilities. Fig. 7 represents the distribution probabilities of the 
failure nodes considering different sensor combinations. As shown, the 
makeup water flowmeter is a productive sensor in fault detection, be- 
cause it can detect the fault of pipeline with a probability of 0.95, which 
is the greatest fault probability among the other sensors. It is note- 
worthy that a higher difference between two component failure status 
(yes and no) provides more certain information about the system based 
on the information theory. As shown in Fig. 7, the leakage fault prob- 
abilities in the pipeline can be detected with more certainty because of 
the large difference between the failure statuses; consequently, the 
sensor signal from the makeup water flowmeter is more valuable than 
the others. 

On the other hand, failure probability of 50% provides the least 
information about failure status as we have two states of yes and no 
failure. It can be seen that most of the failure probabilities in Fig. 7(e) is 
near 50%. This means that the selected sensor could not provide useful 
information about failure status of the system. This fact can be con- 
firmed by Fig. 6, as the related sensor (ambient temperature sensor) 
provides lowest information value in comparison to the other sensors. 

Following the algorithm, all information values gained from sensors 
are compared, and the sensor with the most information value is se- 
lected. In the studied case, make up water flow meter is selected as the 
first optimal sensor. 

According to the optimization algorithm (Fig. 5), at the next step, 
the model updates all posterior distributions based on the selected 
sensor, and the algorithm is repeated to select the next sensor. The 
utility function and expected value function are calculated using Eqs. 
(2-4) for the dual combinations of the selected sensors (make up water 
flowmeter with all the others). The results indicate that the make up 
water flowmeter and outlet water thermometer are the optimum dual 
configurations providing the highest information value in comparison 
with the others. 

By selecting the make up water flowmeter and outlet water ther- 
mometer combination as the optimum next input in the optimization 
algorithm, the selection of the third sensor is optimized. The informa- 
tion value of the triple combinations of sensors is calculated based on 
Eqs. (2-4). 

Fig. 8 illustrates the information value of single, dual, and triple 
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combination of all remaining sensors with two selected ones. As shown Outlet water temperature °c T < 60 60 < T < 80 T > 80 


in the figure, point 12 contains the maximum information value that is 
the triple combination of sensors including the make up water flow- 
meter, outlet water thermometer, and ambient thermometer. 

The increasing trend of information value in Fig. 8 indicates a 
meaningful relation between the combination of sensors and valuable 
detected information. As the number of sensors increases, the in- 
formation value and certainty regarding system failure status increase. 

As presented in Fig. 8, the make up water flowmeter, outlet water, 
and ambient thermometers create an optimum combination that yields 
high information value for condition monitoring analysis. Sensor se- 
lection optimization is stopped at this step owing to budget constraint. 
In other words, with the budget limit of C, the mentioned configuration 
provides the maximum information about system failure modes. 


6.3. Fault diagnostic 


In the previous section, the triple combination of sensors is selected 
as the optimum combination that transfers maximal information about 
the failure modes of the studied CHS. This configuration includes the 
make up water flowmeter, outlet water, and ambient thermometers, 
with 2, 3, and 3 node states in the BN model, respectively. Therefore, all 
possible sensor signal combinations for this configuration would be the 
18 scenarios presented in Table 3. 

For each possible scenario, the probability distributions of the 
failure nodes are calculated and the results are illustrated in Fig. 9. 

Fig. 9 conveys the applicable information for the operator regarding 
the status of the CHS components. For instance, in scenario 7, the sensor 


Table 1 

Technical characteristics of the studied CHS 
Equipment Manufacturer Model 
Boiler [27] Ferroli i29 


Radiator [28] Apollo radiator Monza aluminum 


Burner [29] Ecostar ECO1GC1 


Pump [30] Grundfos UPS2 


signals represent low ambient temperature, high outlet water tem- 
perature, and low make up water flow rate. At low ambient tempera- 
tures, the burner operates such that the outlet water temperature is as 
high as possible to satisfy thermal comfort. Because the sensor signal of 
the outlet water temperature is high, it can be inferred that the burner is 
operating properly. This is shown from the BN results of scenario 7, 
presented in Fig. 9. 

In addition, CHS pipelines operating in a safe mode demonstrate 
low make up water flow rates. As the make up water flowmeter signal 
exhibits a low state, it can be inferred that the pipelines are operating in 
a safe mode with no leakage. This is depicted in scenario 7, Fig. 9. 

The low ambient temperature indicates the necessity of operating 
the heating system at a full load to satisfy thermal comfort. A high 
outlet water temperature indicates that system operates at the full load 
and that thermal comfort is satisfied. 

To analyze the pump health status, the comparison between return 
water temperature and outlet water temperature should be performed. 
In the optimal selected sensors, the return water temperature sensor 
was not selected; consequently, the sensors did not provide any in- 
formation about this temperature. Therefore, the pump health status 
was performed with high uncertainty. 

The other scenarios are analogous and present the system compo- 
nent health status with different certainties. 


Capacity KW Characteristics 

7.9-27 Heat input (KW) 8.3-28 
Supply water temperature (°c) 60-80 

0.88 Stock code AL3C4H800 
Column 3 

10-50 Fan motor power (KW) 0.11 
Fuel consumption (m*/h) 1-5.2 

7-48 Flow rate (m°/h) 3.4 


Head (m) 6.2 
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Fig. 7. Probability distributions of failure nodes with (a): Return water thermometer, (b): Fuel consumption flowmeter, (c): Make up water flowmeter, (d): Outlet 


water thermometer, and (e): Ambient thermometer sensor signals. 


6.4. Remaining useful lifetime (RUL) 


In this section, the real-time RUL of the CHS components are pre- 
sented. The mean time between failure concepts was used to predict the 
RUL of the components, as was used in [35]. Different sensor signals 
result in different failure probabilities; consequently, different RUL 
components were obtained. Therefore, for all possible sensor signal 
combinations that are the 18 scenarios presented in Table. 3, the RUL of 
the CHS components are calculated based on Eqs. (9-11), and presented 
in Fig. 10. It is assumed that the signals were gathered after two years of 
system operation. 

Another important parameter for system analysis is the component 
reliability. The sensor signals can determine the system component’s 
reliability in real time, and the reliability can be predicted based on the 
system historical data (sensor signals). Fig. 11 presents the component’s 
reliability in real time, and their prediction for one of the operation 
scenarios (scenario 7). 

As mentioned previously, system status monitoring is performed 
after two years of operation. Therefore, the system components exhibit 
a different reliability at this stage. For instance, the pump exhibits a 
reliability of 0.8, and it highly affects the CHS’s overall reliability. As 
time passes, the component reliabilities decrease corresponding to the 
failure rates of the components. The reliability of the pipelines and 
burner decreases by approximately 5.2% after 5 years of operation. 


However, the pump reliability decreases significantly owing to its 
higher failure rate in comparison with other components. 


7. Conclusion 


In this paper, a BN-based method was proposed to optimize the 
sensor combination and diagnose the faults of CHSs. Sensor selection 
optimization is crucial in industrial systems. This paper proposes a new 
methodology for sensor combination optimization based on the in- 
formation value of sensor signals. The novelty of the methodology is the 
application of the information value using the BN such that sensors and 
measurement uncertainties are incorporated. The investigated metho- 
dology optimized the sensor combination of a CHS to maximize the 
system information value considering budget limits. 

In addition, the proposed methodology diagnosis system faults in- 
cluded failures in the pump, pipelines, and burner, and the thermal 
comfort with high accuracy. Furthermore, the reliability and RUL of the 
components were calculated in the real-time operation of the system. 

The developed methodology was applied to a typical heating unit in 
an official building. Its BN was learned using the experimental data 
tested in two seasons. The results indicated that the proposed metho- 
dology was effective in sensor combination optimization problems with 
multiple criteria that involved uncertainty. It should be noted that the 
results are for a simple case study, and the usefulness of the method is 
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Fig. 8. Information value of single, dual, and triple sensor configurations. 


Table 3 

Possible sensor signal combinations for optimal selected sensors 
Scenario Make up water Outlet water Ambient 

flowmeter thermometer thermometer 

1 Low(Q < 1) T < 60 T < 12 
2 Low (Q < 1) T < 60 12<T< 21 
3 Low(Q < 1) T < 60 T21 
4 Low(Q < 1) 60 < T < 80 T <12 
5 Low(Q < 1) 60 < T < 80 12<T< 21 
6 Low(Q < 1) 60 < T < 80 T >21 
7 Low(Q < 1) T > 80 T < 12 
8 Low(Q < 1) T > 80 12<T< 21 
9 Low(Q < 1) T > 80 T21 
10 High(Q > 1) T < 60 T<12 
11 High(Q > 1) T < 60 12 <7 < 21 
12 High(Q > 1) T < 60 T>21 
13 High(Q > 1) 60 < T < 80 T< 12 
14 High(Q > 1) 60 < T < 80 12.<T < 21 
15 High(Q > 1) 60 < T < 80 T>21 
16 High(Q > 1) T >80 T< 12 
17 High(Q > 1) T > 80 12<T<21 
18 High(Q > 1) T > 80 T >21 


more obvious in large CHSs as they have wider range of sensor com- 
bination alternatives and higher failure costs. 

Furthermore, the system component reliability and RUL were pre- 
sented for all possible operation modes in a real-time operation. It is 
noteworthy that predicting the real-time reliability and RUL of the 
components is challenging using the conventional fault diagnosis 
methods. However, our results indicated that this method is robust in 
fault diagnostics and RUL prediction. The proposed methodology can be 
applied to any complex system to select the optimum sensor combi- 
nation, and detect system faults and RUL with minimal uncertainty. 


Nomenclature 
Variables 


Independent nodes a; 

Number of nodes k 

Maximum number of sensor types n 
Sensor node/number of random variables i 
State of node i j 

Base of the logarithm b 

Temperature T 

Theoretical maximum expected value of the utility function E* 
Random variables X 

Failure rate of system’s component A 

Flow rate m 

Make up water flow rate Q 

Parameter 


Joint probabilities over a random variable P(X) 
Information entropy of a BN node H(X) 

Probability mass function of the state j of node i P(Xij) 
Utility function U(X) 

Utility function of the BN considering the evidence j of sensor i as an 
input U(Xij) 

Expected value E(U) 

Available budget C 

Maximum available budget C* 

Certain event S 

Posterior probability P(X;|a) 

PRobability of survival R(t) 

Failure probability F(t) 

Remaining useful life RUL 


T. Parhizkar, et al. Journal of Building Engineering 26 (2019) 100835 


Scenario 1 Scenario 2 Scenario 3 





Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal Pump Burner Pipeline . Thermal 
failure failure leakage comfort failure failure leakage comfort failure failure leakage comfort 
failure failure failure 


Scenario 4 Scenario 5 Scenario 6 





Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal 
failure failure leakage comfort failure failure leakage comfort failure failure leakage comfort 
failure failure failure 


Scenario 7 Scenario 8 Scenario 9 








Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal 
failure failure leakage comfort failure failure leakage comfort failure failure leakage comfort 
failure failure failure 


Scenario 10 Scenario 11 Scenario 12 








Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal 
failure failure leakage comfort failure failure leakage comfort failure failure leakage comfort 
failure failure failure 


Scenario 13 Scenario 14 Scenario 15 





Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal Pump Burner Pipeline Thermal 
failure failure leakage comfort failure failure leakage comfort failure failure leakage comfort 
failure failure failure 


Scenario 16 Scenario 17 Scenario 18 





Pump Burner Pipesline Thermal Pump Burner Pipesline Thermal Pump Burner Pipesline Thermal 
failure failure leakage comfort failure failure leakage comfort failure failure leakage comfort 
failure failure failure 


Component failure status MM) No HMM Yes 


Fig. 9. Probability distributions of failure nodes for 18 scenarios of operation. 
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Fig. 10. RUL of the pump, burner, and CHS after two years of operation for 18 scenarios. 
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Fig. 11. Reliability prediction of CHS components for scenario 7. 
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